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Abstract —As the capacity demand of mobile applications keeps 
increasing, the backhaul network is becoming a bottleneck to 
snpport high quality of experience (QoE) in next-generation 
wireless networks. Content caching at base stations (BSs) is a 
promising approach to alleviate the backhanl burden and reduce 
user-perceived latency. In this paper, we consider a wireless 
caching network where all the BSs are connected to a central 
controller via backhaul links. In such a network, users can obtain 
the required data from candidate BSs if the data are pre-cached. 
Otherwise, the user data need to be first retrieved from the central 
controller to local BSs, which Introduces extra delay over the 
backhaul. In order to reduce the download delay, the caching 
placement strategy needs to be optimized. We formulate such 
a design problem as the minimization of the average download 
delay over user requests, subject to the caching capacity constraint 
of each BS. Different from existing works, our model takes BS 
cooperation in the radio access into consideration and is fully 
aware of the propagation delay on the backhaul finks. The design 
problem is a mixed Integer programming problem and is highly 
complicated, and thus we relax the problem and propose a 
low-complexity algorithm. Simulation results will show that the 
proposed algorithm can effectively determine the near-optimal 
caching placement and provide significant performance gains over 
conventional caching placement strategies. 

Index Terms —Wireless caching networks, caching placement, 
download delay 

I. Introduction 

The explosive growth of global mobile data traffic IT), espe¬ 
cially mobile video streaming, has led to significant increases 
in user latency and imposed a heavy burden on backhaul links 
that connect local BSs to the core network. The congestion in 
the backhaul may cause excessively long delays to the content 
delivery, and thus degrades the overall quality of experience 
(QoE). In order to support the increasing mobile data traffic, 
one promising approach to reduce delivery time and backhaul 
traffic is to deploy caches at BSs, thereby bringing frequently 
requested bulky data (such as videos) close to the users m 
in the hope of satisfying user requests without increasing the 
burden over backhaul links. In this way, the backhaul is used 
only to refresh the caches when the user request distribution 
evolves over time. Usually, the refreshing process does not 
require high-speed transmission and can work at off-peak times. 

Caching capacity of local BSs can be regarded as a new 
type of resources of wireless networks besides time, frequency 
and space. However, the caching capacity is limited compared 
with the total amount of mobile traffic. Thus sophisticated 
caching placement strategies will be needed to fully exploit the 
benefit of caching. With the knowledge of channel statistics 


and file popularity distribution, the central controller is able to 
determine an optimal caching strategy to cater for user requests 
with locally cached content. Once caches are fully utilized, 
the requirements for backhaul can be greatly reduced and the 
download delay can be shortened, especially when the backhaul 
links are in poor conditions. 

So far, the design of caching placement has not been fully 
addressed. Most of the previous studies fail to take physical 
layer features into consideration. For example, it was assumed 
in |3| and SI that the wireless transmission was error-free. 
In |5l, delays of D2D and cellular transmissions were simply 
set as constants and the proposed caching strategy was to 
store as many different files as possible. However, when taking 
multipath fading into consideration, storing the same content at 
multiple BSs can actually provide channel diversity gains and 
is perhaps more advantageous. The authors in |6l analyzed both 
uncoded and coded femto-caching in order to minimize the total 
average delay of all users. In their work, coded femto-caching 
was obtained as the convex relaxation of the uncoded problem. 
Nevertheless, it not only ignored physical layer features, but 
also imposed a certain network topology requirement which 
cannot be fulfilled in practice. Physical-layer operation includ¬ 
ing data assignment and coordinated beamforming in caching 
networks was considered in Q, but the caching placement was 
given as a prior. 

There are also works studying the dynamic caching place¬ 
ment and update. In |8l, the authors studied video caching 
in the radio access network (RAN) and proposed caching 
policies based on the user preference profile. Nevertheless, 
they considered neither the variation of the wireless channel 
during the transmission of a file nor the actual delay of 
wireless transmission and backhaul delivery. The authors of |3 
concentrated on the caching content optimization in a single BS. 
The file popularity was assumed unknown and their strategy 
was optimized based on the observation of user request history 
over time. However, this work did not consider the effect of 
backhaul delays and assumed that the cache replacement was 
of negligible duration and operated frequently. 

In this paper, we present a wireless caching network model to 
determine the optimal caching placement strategy for managing 
random user requests. In particular, we aim at minimizing the 
average download delay, which is one of the key QoE metrics, 
by taking wireless channel statistics into account. Moreover, to 
the best of our knowledge, the impact of backhaul delays on 
caching placement is studied in this paper for the first time. 


The design of the caching placement strategy is formulated 
as a mixed-integer nonlinear programming (MINLP) problem, 
which is computationally difficult to solve. Thus we resort to 
the relaxed formulation of the problem and provide a low- 
complexity algorithm. Simulation results show that the strategy 
derived from our proposed algorithm outperforms other well- 
known caching placement strategies. Furthermore, we provide 
some insights into the caching placement design. Specifically, 
in the case where the backhaul delay is very small, the most 
popular content has a higher priority to be cached. On the other 
hand, when the backhaul delay is relatively large, it should 
be encouraged to maximize the caching content diversity, i.e., 
to cache as much different content as possible, to reduce the 
chance of invoking backhaul transmission. 

II. System Model 

In this work, we consider the downlink of a wireless caching 
network, in which K single-antenna BSs and U single-antenna 
mobile users are uniformly distributed in the considered region. 
Through backhaul links, the BSs are connected to a central 
controller, which also acts as a file library server. The library 
contains F files and each of them can be divided into L 
segments of equal size (with b bits). Each BS is equipped with a 
storage unit of limited capacity. For simplicity, we assume that 
all users see channels with the same distribution. Without loss of 
generality, we focus on one user for calculating the performance 
metric of interest. A BS will be regarded as a candidate BS for 
user i if it holds the content requested by user i, no matter 
whether such content is previously cached or retrieved via the 
backhaul. User i has its cluster of candidate BSs, denoted as 
which consists of BS indices and has cardinality |$i|. The 
system model and data retrieval process are illustrated in Fig. 
[T] The user will choose the BS of the best wireless channel in 
to communicate with, as shown in Fig. [T] (a). As for the 
case that no BS holds the requested content, as shown in Fig. 
[U (b), the central controller will pass such content to all BSs 
and we shall have |$i| = K. 

A. File Transmission Model 

We consider segmented file transfer Eol (also known as 
multi-source file transfer), which has the advantage of allowing 
a requested file to be sourced and downloaded from different 
BSs in various time slots. Different segments of a requested file, 
when reaching the user, will be decoded and assembled into a 
complete file. Each segment can be as small as a single packet 
or as big as a stream lasting for several minutes. We will first 
discuss how to calculate the delay of downloading a segment, 
which is typically quantified as the average number of required 
time slots for a segment to be successfully decoded. Under the 
assumption that the segments of a file are subsequently sent 
over homogenous and ergodic channels, the download delay of 
this file is indeed the sum of all segment-download delays. 

We shall assume that time is partitioned into r-second-wide 
time slots indexed by s. We will also assume that at each time 
slot, user i communicates with the BS in <i>i that provides the 
highest signal-to-interference-plus-noise ratio (SINR), and this 
BS is able to transmit N ^ 1 complex symbols. Let hi [s] € C 




(b) 

Figure 1. System model and content retrieval process, (a) Content retrieval 
from only local BSs, <I>i = {2, 4}; (b) Content retrieval from the central 
controller with extra backhaul delay, <!>; = {1, 2, 3, 4}. 


denote the channel coefficient between the selected BS and user 
i, following the same distribution for all i and s. The channel 
is assumed to be block-fading with block length r, i.e., the 
channel is constant for a duration of r seconds and different 
channel block realizations are i.i.d.. At time slot s, the transmit 
signal from the selected BS to user i is denoted as 

ai* [s] = (Tji [s] ,... [s]) (1) 

and the transmit power constraint is E | \xin [s] |^ | < Pt for all 
i G {1,..., U}, n G {1,... ,N} and s. 

The received signal of user i for time slot s can be written 
as 


Vi [s] = [s] Xi [s] -f n, [s], (2) 

where y, [s] = {yn [s],..., [s]) and n, [s] = 

(nil [s] 5 ■ • • j iiiAT [s]). The noise is complex white Gaussian, 
i.e., rii [s] ~ CN (O, cr^). 













The transmission of a segment to the intended user, in 
accordance with an incremental redundancy (IR) hybrid-ARQ 
protocol as adopted in LTE ifTTl . will be presented below. Given 
a well-designed Gaussian codebook, the 6-bit file segment is 
encoded into a channel code c = {ci| C 2 I ... |ct}, where T is 
a given positive integer, which can be made arbitrarily large, 
and each subcode Cj, containing N complex symbols, can be 
individually used to recover the original file segment. These 
subcodes are further modulated into a series of signal bursts. 
At each time slot, one signal burst is allowed to be sent. If this 
burst is not decoded correctly, the user feeds back a negative 
acknowledgment (NACK) message over an error-free and low- 
latency channel. Once the BS receives this NACK message, the 
next burst is sent at the next time slot. This process continues 
until the BS receives an acknowledgement (ACK) message. If 
the transmission starts from the first time slot and ends at the S- 
th time slot, the effective coding rate is R/S, where the coding 
rate is defined as R = b/N bits/sec/Hz as a complex symbol 
can be transmitted approximately in 1 s and 1 Hz ifT^ . 


B. Caching Placement Strategy 


In practice, the caching capacity at BSs cannot be arbitrarily 
large. Thus, it is crucial to optimize the caching strategy to 
exploit the limited caching capacity in order to maximize the 
benefits brought by caching. 

We denote the caching placement strategy as a three- 
dimensional matrix C = [Ci,... Ck, ■ ■ ■, Ck] S {0, 
consisting of binary entries, where Ck, indicating the caching 
placement of BS k, is defined as 


A 




G{0,1} 


FxL 


(3) 


in which c^i = 1 means that segment I of file / is cached at 
BS k in advance and Cj; = 0 means the opposite. Since each 
BS has a limited caching capacity C, we have 


F L 


/=! 1=1 


yke{i,...,K}. 


(4) 


When user i requests segment I of file /, the number of 
candidate BSs holding the segment is 


(5) 

k^l 

Note that there should be a constraint Q< Nfi<K for Mf & 
{!,...,F},VZ G {l,...,L}in order to avoid duplicate caching 
for the same segment at a BS. 


III. Backhaul-Aware Caching Placement 

The average download delay is a representative metric for 
the system performance in wireless caching networks. In this 
section, we will first derive an analytical expression of the 
download delay. We will then formulate the problem of min¬ 
imizing the download delay subject to constraints on caching 
capacities. 


A. Download Delay 

We assume Rayleigh fading channels and no interference 
among different users, e.g., they can be served by different 
subcarriers with orthogonal frequency-division multiple access 
(OFDMA). Denote the signal-to-noise ratio (SNR) from BS k 
to user i as SNRJ', and the probability density function (PDF) 
of SNR^ is given by 



where p is the average received SNR and we have p = Ptja^. 
According to our previous assumption, at time slot s, the 
received SNR of user i can be obtained as 


Pi = max SNRf. (7) 


Hence, the PDF of pi is 


Ppi (x) = exp 






( 8 ) 


For simplicity, we omit the subscript i in the following 
derivation. 

In the IR scheme, each user has a buffer with size m. Hence, 
up to m of the most recent signal bursts can be stored and 
used to decode the information. In practice, m is chosen to 
reach a compromise between the decoding performance and 
the implementation cost. If the buffer is big enough, m can 
be regarded as infinity. If only the current burst is used for 
decoding, we will have m = 1. It has been indicated in IfT^ 
that the mutual information across multiple time slots can be 
written as 


R[s]= log 2 (1 +Pb']) . (9) 

i=(s —m)"*" + l 

where p [j] is the received SNR at time slot j. When employing 
typical set decoding, the probability of decoding error for user 
i at time slot s can be expressed as 


q[s]=Pr{R[s]<R}. (10) 


It is difficult to obtain a closed-form expression for the average 
download delay. Hence, we shall derive a lower bound. 


Theorem 1. For the IR hybrid-ARQ protocol, the average 
download delay of a segment in this system model is lower 
bounded by 


E > 


1 


1 - 





D / \ \ ’ 

^)) 


Proof: For a given user, the probability that its download 
delay of a segment (6 bits) is larger than d time slots is 
given by 


Pr{D^^^ > d}=Pr{R[s] < R, for s = 1,2,...,d} 


( 11 ) 






The expected download delay for a segment can be obtained as 


+ cx:) 


> d} 


d=0 


1 


l-q[s] 

From (|9ll and (fTOl l. we can get 


( 12 ) 


g[s]=Pr<( log 2 (1+ p[j]) < i? 

^:/ = (s-m) + + l 


(13) 


Note that if log 2 (1 + p [j]) < ^ for Vj S {1,..., s}, we have 
Z]^=(s-m)++i log 2 (1 + P) < R- As a result, we can get 


q [s] > Pr log 2 (1 + p [j]) < for Vj 


= Pm log2 (1 + p) < 


R 


(14) 


With dSl), we can obtain the lower bound of the error probability 
as 


9 [s] > 1 - exp - 


^Rlm _ ^ 

P 




(15) 


Substituting (fTsT i into (fTSl i yields the desired result. ■ 

Theorem [T] implies the download delay for a segment is a 
function of |$|, R, m and p. We will adopt the lower bound 
when calculating the average download delay of a segment, 
which is given by 


-D(|$i| ,i?,m,p) 


1 


1 - 






(16) 


Note that if the requested segment has not been cached in 
any BS, this segment will hrst be delivered to all K BSs 
via backhaul and then the wireless transmission will follow 
the same scheme as the above-mentioned cached segment 
downloading. In that case, the number of candidate BSs is K, 
i.e. , |$| — K, and an extra backhaul delay will be caused, 
which is denoted as S. 


B. Problem Formulation and Relaxation 

There are two cases when determining the candidate BSs. If 
^ 0, we have l^l = Otherwise, when N^i = 0, based 
on our assumption, we have |$| = iT and the segment download 
delay is D {K, R, m, p) -f <5. Accordingly, the download delay 
of hie / can be calculated as 

L 

Y{D{N%R,m,p)-\{Nfi^Q) 

1^1 

+ [D{K,R,m,p) + 5]-l{Nfi = 0)) . (17) 

We suppose that the hie / is requested with probability Pf and 
thus Pf = 1- The requests for segments of the same hie 

are of equal probability. Therefore, the average download delay 
of all the hies can be written as 


F L 

fo {C, R, m,p) = YPfT. P) ■ 1 ^ 0) 

/=! 1 = 1 

+ [DiK,R,m,p)+S]-l{Nfi=0)}. (18) 

Our goal is to minimize the average download delay by 
arranging the placement of segments at each BS subject to 
the caching capacity limit, given physical layer constraints, 
including the coding rate, the buffer size at users and the 
received SNR target. With hxed R, m and p, the caching 
placement problem is formulated as 


’0 : minimize 
c 

/o {C,R,m,p) 

F L 

(19) 

subject to 

<C,VfcG{l,. 

/-I ^-1 

K 

..,K} (Cl) 


> 

VI 

VI 

0 

(C2) 


C G 

(C3) 


where constraint Cl stands for the caching capacity limit of 
each BS and constraint C2 indicates that each segment can be 
cached by at most K BSs. It turns out that problem is 
an MINLP problem and thus it is highly complicated to hnd 
the optimal solution. As a result, we will focus on developing 
effective sub-optimal algorithms. 

By further examining problem 3^^, we hnd that the optimiza¬ 
tion of caching placement boils down to the determination of the 
number of candidate BSs for each segment. In order to simplify 
the notations, we dehne it as a vector x = [xi] G where 

= N^i = 'Yk=i^%- ^6all take two caching 
strategies as an example. That is. 



1 0 


1 1 


0 

1—1 



1 0 

1 

0 0 

5 

1 0 



1 0 


1 0 


1 1 



1 0 

1 

1 0 

5 

0 0 



If Cl is the optimal caching strategy, then C 2 is also optimal. 
This is because both Ci and C 2 correspond to the same vector 
X = [3,1, 2,0]^, which determines the average download delay 
of the two strategies. Therefore, (O can be written as 

= (20) 


with 


/3 = 


^1 — exp 



( 21 ) 


For Xi > 0, we can hnd that D (xi) is convex w.r.t. Xi for 
ViG{l,...,FL}. 

For each segment Xi, for Vi G {1, ■ • ■ ,FL}, its download 
delay is 


D {xi) • 1 (x, ^ 0) + [D {K) + j] • 1 (x, = 0). (22) 

The indicator functions in (l22li will cause the major difficulty 
in designing the caching placement strategy. To resolve this 

















issue, we adopt an exponential function with 0 < a < 1 
to approximate the indicator function 1 {xi = 0). Then, we can 
obtain an approximated function / (x) to represent the average 
download delay of a file as 

FL 

f (x) = ^ {D (x.) • (1 - + [D {K) + S] ■ 

= h (x) + /2 (x), (23) 

where fi (x) and /2 (x) are given by 

FL 

h (X) = Pp^^ {D {x.) + [D (K) + (5] ■ (24) 


/ 2 (x) = -y]Pp^^P(:r,)a"-. (25) 

i=l 

We can find that fi (x) is convex w.r.t. x and /2 (x) is concave 
w.r.t. X, which means that / (x) is the difference of the convex 
(DC) functions. 

With a fixed (3, we consider an approximated problem 
instead of That is, 

: minimize / (x) (26) 

X 

FL 

subject to Xi < KC 

i=l 

Q<Xi<K, Vz G {1,...,PP} 

X G 

If we relax the integer constraint first, problem turns out 
to be a DC programming problem, which is not easy to solve 
directly due to the non-convex smooth objective function / (■). 
The successive convex approximation (SCA) algorithm m 
can circumvent such a difficulty by replacing the non-convex 
object function with a sequence of convex ones. Specifically, 
by starting from a feasible point x(°\ the algorithm generates 
a sequence according to the update rule 

x(*+i)=x(‘)+Z 7 (‘)(x-x(‘)), (27) 

where -xF is the point generated by the algorithm at the f-th 
iteration, 77 ^*^ is the step size for the f-th iteration, and x is the 
solution of a convex optimization problem 

(28) 

FL 

^x,<KC 

i=l 

Q < Xi < K, Vz G {1,... , FL} . 

g is an approximation of / (x) at the (f+l)-th 

iteration, which is defined as 


2 : minimize 

X 

subject to 


g (^x, x^*)) = fi (x)+V /2 (^x - x^*) 


+T 



Algorithm 1 : The SCA Algorithm 

Initialization: Find a feasible point x(°) and set t = 0. 

Repeat 

Solve problem JS and obtain the solution x; 
Update x(*+^) = x^*) + (x — x^*)); 

Set f = f + 1; 

Until stopping criterion is met. 


and is a tight convex upper-bound of / (x). The main steps 
of the algorithm are presented in Algorithm [T] The solution 
obtained from Algorithm [T] then should be rounded due to the 
constraint x G 

IV. Simulation Results 

In this section, we present numerical results to examine the 
performance of the proposed caching placement strategy and to 
investigate the impact of backhaul delay. 

Some previous studies have shown that in practical networks 
the request probability of content can be fitted with some popu¬ 
larity distributions. In this work, we assume that the popularity 
of files follows a Zipf distribution with parameter 7 ^ = 0.6 (see 
IH) and the files are sorted in a descending order in terms of 
popularity. We set the rate target as R = 2.5 bits/sec/Hz and 
the average received SNR as p = 10 dB. We consider the case 
where only the current burst is used for decoding at the user 
side; i.e., m = 1. The range of the backhaul delivery delay S is 
selected on the basis of a measurement operated on a practical 
network, as was done in fS) . Their experiment implied that the 
backhaul delay of a piece of content approximately ranges from 
30% to 125% of its wireless transmit delay. To investigate the 
impact of such delay, we choose S G [0,4]. 

First we compare the performance of the proposed algorithm 
with exhaustive search. The file library has three files and each 
of them is divided into three segments. There are four BSs 
and each has a capacity of two segments. We adopt a step 
size of zy = 1 and the stopping criterion idistributions given 
by — x^*) II 2 /||x(‘) II 2 < 10“"^. Fig. |2] shows that the 

results given by the proposed efficient algorithm are very close 
to those obtained using the exhaustive search. It can also be 
observed that a slight increase in cache size C can significantly 
reduce the download delay, which confirms that caching is of 
great potential to enhance future wireless networks. 

Next, we compare the proposed algorithm with two standard 
caching placement strategies on a large-scale system, where 
there are 50 BSs, each with a capacity of 10"^ segments. The 
file library has 10 ^ files, each of which is divided into 10 ^ 
segments. One standard strategy stores C segments of the most 
popular content (MFC) ID in the cache memory of each BS. 
The other one always places KC different segments in total 
at all BS caches, which aims at the largest content diversity 
(LCD) 13]. For the MFC policy, xi = 2:2 = • ■ • = Xk = C 
and Xk+i = ■ • • = xfl = 0; and for the LCD policy, 
xi = X2 = • ■ • = Xjic = 1 and Xfc+1 = • • ■ = xfl = 0. 
The MFC policy is often adopted when multiple users access a 
few pieces of content very frequently, such as popular movies 
and TV shows. The LCD policy ensures that for most requests. 










Figure 2. Average download delay versus backhaul delivery delay. 



Figure 3. Performance comparison of various caching placement strategies. 

part of the segments can be served by the local caches, instead 
of having to be fetched from the remote central controller 
through the backhaul. The step size and the stopping criterion 
are the same as Fig. |2l Fig. [3 demonstrates that our proposed 
strategy outperforms the other schemes, and important insights 
are revealed. When the backhaul delay is small enough, it 
can be regarded as equivalent to the case of inhnite caching 
capacity. In this case, the best way to save download time 
is to maximize the channel diversity for each segment. As a 
result, the proposed strategy and the MFC policy coincide at the 
point 6=0. When the backhaul delay increases, the advantage 
of caching diversity emerges, and the LCD policy will surpass 
the MFC policy. Our proposed strategy and the LCD policy 
will converge when backhaul links are suffering from severe 
delivery delays. This is because when there is a large 6, even 
a single delivery via backhaul will lead to a huge delay and 
thus backhaul transmission should be prevented as much as 
possible. Therefore, the strategy that can provide maximum 
caching content diversity is favorable. 

V. Conclusions 

This paper presented a framework to minimize the aver¬ 
age download delay of wireless caching networks. A caching 
placement problem which takes into account physical layer 


processing as well as backhaul delays was formulated to 
fully exploit the beneht of caching. As the design problem 
is an MINLF problem, we relaxed it into a DC optimization 
problem and adopted the SCA algorithm to solve it efficiently. 
Simulation results showed that our strategy can signihcantly 
reduce the average download delay compared to conventional 
strategies, and the proposed low-complexity algorithm can 
achieve comparable performance to exhaustive search. More¬ 
over, we demonstrated that the backhaul propagation delay 
will greatly influence the caching placement. Specihcally, when 
the backhaul delay becomes very small or very large, our 
proposed strategy will gradually evolve to the MFC and the 
LCD strategy, respectively. In particular, for a practical value 
of the backhaul delay, the proposed caching placement serves 
as the best strategy. Therefore, it can be concluded that our 
work provides a promising model to formulate the download 
delay for wireless caching networks, and important insights are 
given for determining the optimal caching placement strategy 
under different backhaul conditions. 
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